- Title
- Improved similarity search for large data in machine learning and robotics
- Creator
- Walker, Josiah
- Relation
- University of Newcastle Research Higher Degree Thesis
- Resource Type
- thesis
- Date
- 2017
- Description
- Research Doctorate - Doctor of Philosophy (PhD)
- Description
- This thesis presents techniques for accelerating similarity search methods on large datasets. Similarity search has applications in clustering, image segmentation, classification, robotic control and many other areas of machine learning and data analysis. While traditionally database and data analysis oriented applications of similarity search have been search throughput oriented, in the areas of online classification and robotic control it is also important to consider total memory usage, scalability, and sometimes construction costs for the search structures. This thesis presents a locality-sensitive hash (LSH) code generation method which has a lower computational and technical cost than baseline methods, while maintaining perfor- mance across a range of datasets. This method has an exceptionally fast search structure construction time, which makes it suitable to accelerate even a small number of queries without a prior search structure. A simplified boosting framework for locality-sensitive hash collections is also presented. Applying this framework speeds up existing LSH boosting algorithms without loss of performance. A simplified boosting algorithm is given which improves performance over a state-of-the-art method while also being more efficient.
- Subject
- nearest neighbour search; cover trees; approximate search; locality sensitive hashing; boosting; learning to search; similarity and distance learning; big data; large scale learning; reverse nearest neighbour search
- Identifier
- http://hdl.handle.net/1959.13/1333823
- Identifier
- uon:27161
- Rights
- Copyright 2017 Josiah Walker
- Language
- eng
- Full Text
- Hits: 902
- Visitors: 1368
- Downloads: 572
Thumbnail | File | Description | Size | Format | |||
---|---|---|---|---|---|---|---|
View Details Download | ATTACHMENT01 | Thesis | 3 MB | Adobe Acrobat PDF | View Details Download | ||
View Details Download | ATTACHMENT02 | Abstract | 287 KB | Adobe Acrobat PDF | View Details Download |